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MAP kinase phosphatase mutant 

The present invention relates to DNA encoding proteins contributing to the regulation of a 
plant's response to abiotic stress and in particular genotoxic stress. 

Cells of all organisms have evolved a series of DNA repair pathways which counteract the 
deleterious effects of DNA damage and are triggered by intricate signal cascades. To be 
able to modify or improve DNA repair using gene technology it is necessary to identify key 
proteins involved in said pathways or cascades. Therefore it is the main object of the 
present invention to provide DNA comprising an open reading frame encoding such a key 
protein. 

DNA according to the present invention comprises an open reading frame encoding a 
protein characterized by a stretch of amino acids or component amino acid sequence 
having 40% or more identity with an aligned component sequence of SEQ ID NO: 3. The 
protein characterized by SEQ ID NO: 3 is tracked down with the help of a T-DNA tagged 
Arabidopsis mutant showing hypersensitivity to methyl methanesulfonate (MMS). Said 
hypersensitivity as well as an observed hypersensitivity to other DNA damaging treatments 
such as UV light is indicative of the proteins' involvement in the repair of DNA damage, or in 
signaling pathways implicated in the response to similar genotoxic stress. The mutant is 
also sensitive to elevated temperature and anti-oxidant N-acetylcysteine. The mutant is not 
sensitive to osmotic shock, increased salinity, oxidative stress or elevated ehtylene levels. 
An important characteristic of the mutant is cell death in response to growth in small closed 
vessels. This phenotype can be complemented by addition of abscisic acid (ABA) to the 
growth media. Furthermore, the mutant is more sensitive to exogenously applied ABA 
compared with the wild type which supports the notion that the genes disclosed by the 
present invention (SEQ ID NO: 1) are involved in stress signaling mediated by ABA. 

Sequence alignments of SEQ ID NO: 3 using commercially available computer programs 
such as BLASTP of the NCBI BLAST family of programs or TFASTA or BestFit of the 
Wisconsin Package Software, all based on well known algorithms for sequence identity or 
similarity searches, reveal that stretches of SEQ ID NO: 3 (component sequences) having 
more than 100 and preferably between 120 to 250 amino acids length can show between 
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20% and almost 40% sequence identity to aligned stretches of known phosphatases, 
particularly phosphotyrosine phosphatases, MAP kinase phosphatases or dual specificity 
phosphatases. Protein phosphatases are classified by their substrate specificities as either 
phosphoserine/threonine phosphatases (PSTPs) or phosphotyrosine phosphatases (PTPs). 
The dual specificity phosphatases (DSPs) dephosphorylate both phosphotyrosine and 
phosphoserine/threonine residues and represent a subfamily of PTPs. MAP kinase 
phosphatases (MKPs) belong to the family of DSPs. The sequence VHCCQGVSRS (SEQ 
ID NO: 4) found in SEQ ID NO: 3 can be interpreted as corresponding to the mammalian 
sequence motif IHCXAGXXRS (SEQ ID NO: 5) defining the family of PTPs, wherein the He 
at the first position can be replaced by Val and the Ser at the last position can be replaced 
by Thr 

The present invention defines a new protein family the members of which are characterized 
by component amino acid sequences of more than 100 amino acid length showing 40% or 
higher amino acid sequence identity to aligned component sequences of SEQ ID NO: 3. 
Preferably said component sequences are of more than 120, more than 160 or even more 
than 200 amino acids length. The amino acid sequence identity is preferably higher than 
50% or even higher than 55%. Most preferred are identities higher than 70%. 

An example of DNA according to the present invention is described in SEQ ID NO: 1. The 
amino acid sequence of the protein encoded is identical to SEQ ID NO: 3. After alignment a 
stretch of the protein having about 140 amino acids shows 36% sequence identity to the 
MKP-1 protein described by Sun et al (Cell 75: 487-493, 1993). The identity determined 
after alignment with MKP-2 and MKP-3 is determined as 34% and 26%, respectively. Thus, 
according to the present invention a protein family related to MAP kinase phosphatases can 
be defined the members of which after alignment of a stretch of more than 100 amino acids 
length show 40% or higher amino acid sequence identity to SEQ ID NO: 3. Preferably, the 
amino acid sequence identity is higher than 50% or even higher than 55%. When making 
multiple sequence alignments, certain algorithms can take into account sequence 
: similarities, such as same net charge or comparable hydrophobicity/hydrophilicity of the 
individual amino acids, in addition to sequence identities. The resulting values of sequence 
similarities, as compared to sequence identities, can help to assign a protein to the correct 
protein family in border-line cases. Proteins of particular interest, within the scope of the 
present invention, are MAP kinase phosphatases the amino acid s quence of which 
comprises at least one of the following characteristic amino acid subsequences: 
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(a) TSILYDVFDYFEDV 

(b) FVHCCQGVSRST 
(C) FVHC 

(d) QGVSRS 

(e) YFKSD 



(SEQ ID NO: 6) 
(SEQ ID NO: 7) 
(SEQ ID NO: 8) 
(SEQ ID NO: 9) 
(SEQ ID NO: 10) 



DNA encoding proteins belonging to the new protein family according to the present 
invention can be isolated from monocotyledonous and dicotyledonous plants. Preferred 
sources are corn, sugar beet, sunflower, winter oilseed rape, soybean, cotton, wheat, rice, 
potato, broccoli, cauliflower, cabbage, cucumber, sweet corn, daikon, garden beans, 
lettuce, melon, pepper, squash, tomato, or watermelon. However, they can also be isolated 
from mammalian sources such as mouse or human tissues. The following general method, 
can be used, which the person skilled in the art will normally adapt to his specific task. A 
single stranded fragment of SEQ ID NO: 1 or SEQ ID NO: 2 consisting of at least 15, 
preferably 20 to 30 or even more than 100 consecutive nucleotides is used as a probe to 
screen a DNA library for clones hybridizing to said fragment. The factors to be observed for 
hybridization are described in Sambrook et al, Molecular cloning: A laboratory manual, Cold 
Spring Harbor Laboratory Press, chapters 9.47-9.57 and 11.45-11.49, 1989. Hybridizing 
clones are sequenced and DNA of clones comprising a complete coding region encoding a 
protein with more than 40% sequence identity to SEQ ID NO: 3 is purified. Said DNA can 
then be further processed by a number of routine recombinant DNA techniques such as 
restriction enzyme digestion, ligation, or polymerase chain reaction analysis. 

The disclosure of SEQ ID NO: 1 enables a person skilled in the art to design 
oligonucleotides for polymerase chain reactions which attempt to amplify DNA fragments 
from templates comprising a sequence of nucleotides characterized by any continuous 
sequence of 15 and preferably 20 to 30 or more basepairs in SEQ ID NO:1. Said 
nucleotides comprise a sequence of nucleotides which represents 15 and preferably 20 to 
30 or more basepairs of SEQ ID NO: 1 . Polymerase chaiin reactions performed using at 
least one such oligonucleotide and their amplification products constitute another 
embodiment of the present invention. 
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Knowing the nucleotide sequence of the Arabidopsis MKP1 gene and the amino acid 
sequence of the encoded protein it is possible to identify proteins interacting with AtMKPI 
and to clone their corresponding genes using well known techniques. For example 
radioactively labeled AtMKPI protein can be used for interactive cloning on cDNA 
expression libraries. AtMKPI protein or parts thereof can be used to generate polyclonal or 
monoclonal antibodies specific for AtMKPI . The AtMKPI gene can be used to generate 
variants of AtMKPI protein tagged with GST, MYK or His. Said antibodies and MKP1 
variants allow to isolate native protein complexes by immunoprecipitation and to determine 
sequences of proteins present in these complexes by micro-sequencing. The resulting 
sequence information can in turn be used to clone corresponding genes. Alternatively, said 
antibodies or tagged MKP1 variants can be used to screen epitope libraries for epitopes 
which interact with AtMKPI protein. The AtMKPI protein and parts thereof, in particular the 
N-terminal 490 amino acid region and the C-terminal 492 amino acid region can also be 
used to search for interacting proteins with a Two-hybrid system (e.g. in yeast, in 
mammalian cells, or in bacteria). This allows to obtain sequence information about 
interacting proteins. 

Based on the disclosed finding that AtMKPI proteins are involved in a plants abiotic 
environmental stress response, it becomes possible to engineer the corresponding 
signaling pathway, of which AtMKPI is a part, to be chemically regulated due to chemical 
activation or repression of transgenes encoding AtMKPI or proteins interacting therewith. 
Such plants can be obtained by transformation with the corresponding genes under control 
of chemically inducible promoters. Application of inducers is expected to modify the activity 
of the AtMKPI signaling pathway and to result in altered adaptation to abiotic environmental 
stress. Alternatively, AtMKPI protein or its interacting proteins can be used as targets for 
chemicals inhibiting or stimulating their activities which again is expected to modify abiotic 
stress responses. 

EXAMPLES : 

Example 1 : Cloning of the gene responsible for the mkpl mutant phenotype 

Arabidopsis T-DNA insertion lines as produced by the INRA-Versaiiies and available from 
the Nottingham Arabidopsis Stock Center (NASC) are screened for sensitivity to methyl 
methanesulfonate (MMS) at a concentration of 100 ppm as described by Masson et al, 
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Genetics 146: 401-407, 1997. Plants which die in the presence of 100 ppm MMS are found 
in the family AAN4. Thus, the corresponding T-DNA insertion mutation is assumed to give 
rise to this hypersensitive phenotype. This assumption is supported by genetic analysis 
showing co-segregation of the hypersensitive phenotype with the T-DNA insertion. 

Genomic DNA from the mutant plants is isolated as described by Dellaporta et al, Plant Mol 
Biol Reporter 1 : 19-21, 1983. A fragment of genomic DNA flanking the right border of the 
inserted T-DNA is rescued essentially according to Bouchez et al, Plant Mol Biol Reporter 
14: 1 15-123, 1996, with minor modifications. Genomic DNA is digested with Pstl, ethanol 
precipitated and resuspended in H 2 0. DNA of vector pResc38 (Bouchez et al supra) is 
digested with Pstl and dephosphorylated with shrimp alkaline phosphatase. The 
phosphatase is heat inactivated, the vector DNA is ethanol precipitated and resuspended in 
H 2 0. 2.5 |ig of Pstl digested genomic DNA and 2.5 jig of Pstl digested and 
dephosphorylated vector are mixed and ligated overnight at room temperature in a total 
volume 100 \i\ in the presence of 10 units of T4 DNA ligase. The DNA of the ligation mixture 
is precipitated with ethanol, resuspended in 50 |il H 2 0, and digested with Xbal in a total 
volume of 100 \il Xbal digested DNA is precipitated with ethanol and resuspended in H 2 0. 
A second overnight ligation reaction in the presence of 10 units T4 DNA ligase is performed 
in a total volume of 200 nl at room temperature to achieve circularization of DNA fragments. 
The DNA of the ligation mixture is again precipitated with ethanol, rinsed two times with 
70% ethanol, dried and dissolved in 5 ^il H 2 0. Two 2 \x\ aliquots are used for electroporation 
of electrocompetent E.coli XL1-Blue cells (Stratagene) according to the manufacturer's 
instructions. Transformants containing the inserted T-DNA and adjacent Arabidopsis 
genomic DNA sequences are selected on plates with 50mg/l kanamycin. Single bacterial 
colonies are analyzed by isolation of plasmid DNA using QIAprep Spin Plasmid Kit (Qiagen) 
and restriction digestion with Pstl and Xbal. Plasmid pBN1 containing 3.7 kb of inserted 
T-DNA linked to 5 kb of Arabidopsis DNA is identified. Sequencing of the junction site is 
performed using a primer directed towards the flanking plant DNA and having the 
nucleotide sequence 5 ' -ggtttctacaggacgtaacat-3 ' (SEQ ID NO: 14) complementary 
to T-DNA 41 nucleotides from the right border. Digestion of this clone with Sstl allows 
isolation of a 960 bp fragment which when labelled with 32 P can be conveniently used as a 
probe to screen wild type Arabidopsis genomic and cDNA libraries in order to identify the 
wild type gene affected in the mkp1 mutant line. 
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Example 2: Cloning of the AtMKPI wild-type gene 

The 960 bp Sstl fragment mentioned at the end of example 1 is labeled with 32 P by random 
oligonucleotide-primed synthesis (Feinberg et al, Anal Biochem 132: 6-13, 1983) for use as 
a probe in the following hybridization experiments. 

Southern blot analysis of Arabidopsis wild type and mkp 1 DNA digested with EcoRV 
confirms that in the mkp1 genomic DNA the sequence hybridizing to the probe is linked to 
T-DNA. 

Northern blot analysis of Arabidopsis wild type RNA reveals the presence of a hybridizing 
transcript in RNA extracted from seven-day-old wild type seedlings. No such hybridizing 
fragment is detected in the corresponding RNA of mkpl seedlings. 
A cDNA library (Elledge et al, 1 991 ) and a genomic library (Stratagene) of wild type 
Arabidopsis thaliana ecotype Columbia is screened with the labelled Sstl fragment 
mentioned above. Screening of the bacteriophage X libraries is performed according to the 
protocols described in chapter 6 of Ausubel et al, 1994, "Current protocols in molecular 
biology", John Wiley & Sons, Inc. Hybridization is performed as described by Church and 
Gilbert, Proc Natl Acad Sci USA 81: 1991-1995, 1984. Bacteriophage clones hybridizing to 
Sstl fragment are subjected to in vivo excision of plasmids according to Elledge et al, Proc 
Natl Acad Sci USA 88: 1731-1735, 1991, and Stratagene protocols. Inserts of the plasmids 
obtained are further analyzed by sequencing. 

By partial sequencing and alignment of ten overlapping clones (pBN5.1 to pBN5.10) 
isolated from the genomic library a continuous genomic sequence of 6356 bp (see SEQ ID 
NO: 1 ) is decoded. 

Ten cDNA clones representing the same gene, one of them a 3.0 kb full-length cDNA (SEQ 
ID NO: 2), are isolated from the cDNA library. 

Example 3: Sequence Analysis and Alignments 

The 3 kb full-length cDNA clone of SEQ ID NO: 2 encodes an ORF with the start codon 
being defined by basepairs 298-300 and the stop codon by basepairs 2650-2652. The ORF 
encodes a protein consisting of 784 amino acids (SEQ ID NO: 3) and a predicted molecular 
mass of 86.0 kD. Alignment with the genomic sequence of SEQ ID NO: 1 reveals three 
introns. T-DNA is inserted within the coding sequence of the mkpl mutant DNA before 
basepair position 502 according to the numbering of SEQ ID NO: 2. The sub-sequence 
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VHCCQGVSRS (SEQ ID NO: 4) found in SEQ ID NO: 3 can be interpreted as 
corresponding to the mammalian sequence motif IHCXAGXXRS (SEQ ID NO: 5) defining 
the family of protein tyrosine phosphatases, wherein the He at the first position can be 
replaced by Val and the Ser at the last position can be replaced by Thr (Van Vactor et al, 
Curr Opin Gen Dev 8: 1 12-126, 1998). Therefore it is concluded that the wild type ORF 
encodes a protein tyrosine phosphatase with invariant aspartic acid, cysteine, and arginine 
residues (Fauman et al, Trends Biochem Sci 21: 413-417, 1996) in positions 204, 235 and 
241 according to SEQ ID NO: 3. 

A data base search using the TFASTA program (Wisconsin Package Version 9.1, Genetics 
Computer Group (GCG), Madison, Wise.) reveals that the encoded phosphatase has a 
significant similarity to dual specificity phosphatases. The closest homologue identified is 
Xenopus laevis MAP kinase phosphatase (MKP; Lewis et al, J Cell Sci 108: 2885-2896, 
1995) showing 38.1% identity and 52.5% similarity in a 140 amino acid overlap region. The 
deduced AtMKPI protein also has 36.0% identity and 52.5% similarity with a 140 amino 
acid overlap region encoded by the rat 3CH134/CL100 cDNA representing a rat MKP1. 
Essentially identical results are obtained when using the BLASTP 2.0.4 (Feb-24-1998) 
program of the NCBI BLAST family of programs which, allowing gapped alignment, 
compares an amino acid query sequence against a protein sequence database (Altschul et 
al, Nucleic Acids Res. 25: 3389-3402, 1997). No higher plant homologues are identified. 
The genomic position of the AtMKPI gene is determined by hybridization to filters 
containing genomic YAC clones publicly available from the Arabidopsis Biological Resource 
Center (Ohio, USA). The AtMKPI gene is found to map to chromosome 3 between markers 
ve022 and BGL1 . 

Example 4: Complementation 

mkpl mutant plants are transformed with DNA comprising the corresponding wild type 
genomic DNA including promoter and polyadenylation signal to find out whether the cloned 
wild type gene is able to complement the mutant mkpl phenotype. 
mkpl mutant plants harbor T-DNA containing the NPTII and bar marker genes under the 
control of nos and CaMV35S promoters, respectively. Therefore, different marker genes are 
used for the transformation construct. The vector used is a derivative of pl'barbi which is 
highly efficient in Arabidopsis transformation (Mengiste et al, Plant J 12: 945-948, 1997). In 
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pVbarbi the EcoRI fragment containing Vpromoter, bar gene coding region, and CaMV 35S 
polyadenylation signal is inverted in respect to the T-DNA borders by EcoRI digestion and 
re-ligation. In the resulting plasmid the Vpromoter (Velten et al, EMBO J 3: 2723-2730, 
1984) is directed towards the right border of the T-DNA. This plasmid is digested with 
BamHI and Nhel, and the bar gene and CaMV 35S polyadenylation signal are replaced by a 
synthetic polylinker with the sites for the restriction enzymes BamHI, Hpal, Clal, Stul and 
Nhel. The resulting plasmid is digested with BamHI and Hpal and ligated to a BamHI-Pvull 
fragment of pROB1 (Bilang et al, Gene 100: 247-250, 1991) containing the hygromycin-B- 
resistance gene hph linked to the CaMV 35S polyadenylation signal. The T-DNA of the 
resulting binary vector pl'hygi contains the hygromycin resistance selectable marker gene 
under the control of the Vpromoter and unique cloning sites for the restriction enzymes 
Clal, Stul and Nhel located between the marker gene and the T-DNA right border. 
pVhygi is used to insert the reconstructed AtMKPI gene as follows. Plasmid pBN1 of 
example 1 is digested with Pstl and Muni and dephosphorylated. The restriction fragment 
containing the 3'portion of the AtMKPI gene and pBIuescript-SK(+) is purified from the 
agarose gel and ligated to the Pstl-Munl restriction fragment of the wild type genomic clone 
pBN5.2 (example 2) including the 5' end of the coding sequence of the AtMKPI gene and 
2.4 kb of upstream sequences. The reconstructed AtMKPI gene is excised by Pstl and A/of/ 
and after filling the ends is inserted into the Stul site of pl'hygi. The construct is introduced 
by transformation into Agrobacterium tumefaciens strain C58CIRif R containing the non- 
oncogenic Ti plasmid pGV3101 (Van Larebeke et al, Nature 252: 169-170, 1974). T-DNA 
containing the reconstructed AtMKPI gene is transferred to mutant plants by the method of 
in planta Agrobacterium mediated gene transfer (Bechtold et al f C R Acad Sci Paris, Life Sci 
316: 1194-1199, 1993). Seeds of infiltrated plants are grown on hygromycin-containing 
medium to screen for transformants. The progeny of selfed hygromycin resistant plants is 
analyzed for the segregation of hygromycin resistance. The families in which a 3:1 
segregation ratio is observed are used to isolate homozygous lines bearing the newly 
introduced T-DNA inserted at a single genetic locus. The obtained hygromycin resistant 
Jines are analyzed by Northern blot analysis for the restoration of AtMKPI expression. In 
these lines the restoration of transcription of the AtMKPI gene can be observed as well as 
the restoration of the wild typ level of MMS resistance and ABA mediated stress 
responses. Complementation is not observed in plants transformed with p1 'hygi only. 
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Example 5: Cloning of homologous sequences from other plant species 

Use of AtMKPI cDNA as a probe for Southern hybridization with genomic DNA from other 
plant species such as Sinapis alba (mustard), Lycopersicum esculentum (tomato) and Zea 
mays (maize) is successful in the case of Sinapis alba which belongs to the same family as 
Arabidopsis (Brassicaceae). 

Homologous sequences from the other species can be identified in a PCR approach using 
degenerate primers 1-3 below, wherein I is inosine, derived from the regions conserved 
between VH-PTP13 of Clamydomonas eugametos and AtMKPI protein: 

Primer 1 (forward): 5 ' -aay aay ggi ath aci cay ath yt-3' (SEQ ID NO: 11); 
Primer 2 (reverse): 5 ' -ytg rca igc raa icc cat rtt igg-3' (SEQ ID NO: 12); 
Primer 3 (reverse): 5'-igt cca cat iar rta igc dat iac (SEQ ID NO: 13); 

A PCR reaction is performed in a total volume of 50 |il containing 1x reaction buffer 
(Qiagen), 200 (iM of each dNTP, 1.25 units of Taq polymerase (Qiagen), and 100 pmol of 
each primer. 

Reaction 1 is performed with primers 1 and 2 using genomic DNA from Sinapis alba 
(200 ng), Lycopersicum esculentum (400 ng), or Zea mays (600 ng) as the original template 
DNA. Amplification is carried out after an initial denaturation step of 3„min at 94°C, followed 
by 30 cycles of 30 sec at 94°C, 30 sec at 40°C, and 3 min at 72°C. The resulting 
amplification mixture is diluted 10 3 fold. 

Reaction 2 is performed using 2\jd of the above dilution to provide the necessary template 
DNA. This time primers 1 and 3 are used under the same conditions as specified for 
reaction 1 . The resulting amplification products are cloned into the T/A vector pCR2.1 
(Inyitrogen) and further analyzed by nucleotide sequencing. 

Using this PGR approach it is possible to amplify sequences homologous to the AtMKPI 
gene from all the species mentioned above. Whereas the nucleotide sequence from Sinapis 
alba SaMKPI ((SEQ ID NO: 15 encoding SEQ ID NO: 16) is 90.8% identical' to the 
AtMKPI sequence, the nucleotide sequence from Lycopersicum esculentum LeMKPI 
(SEQ ID NO: 17 encoding SEQ ID NO: 18) is 72.3% and the Zea mays sequence 
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ZmMKPI (SEQ ID NO: 19 encoding SEQ ID NO: 20) 71.8% identical. The fragments 
hybridize to genomic DNA from corresponding species under the usual hybridization 
conditions for Southern blot analysis. The fragments can be used as probes to screen 
cDNA libraries for corresponding cDNA sequences. 

The 243 bp ZmMKPI fragment amplifying from maize DNA is used as a probe to screen a 
maize cDNA library (Clontech) made in the Lambda ZAP®II Vector (Clontech) from 
"Blizzard" hybrid etiolated shoots, which were treated with the herbicide safener Benoxacor. 
The titer of the library is determined as 3x10 9 pfu/ml. 

Library screening is conducted as described in the Clontech Lambda Library Protocol 
Handbook, with some slight modifications. Briefly, a single colony of XL-1 Blue is picked 
and incubated overnight at 37°C in LB medium, containing 10mM MgSQ 4 and 0.2% 
maltose. 600 \i\ of stationary phase grown bacteria for each 150 mm plate is combined with 
100 |il of phage library dilution in sterile 1x lambda dilution buffer (100mM NaCI; 10mM 
MgS0 4 ; 35mM Tris-HCI, pH7.5) to yield approximately 30,000 pfu per plate. This mixture is 
incubated at 37°C for 15 minutes, subsequently 7 ml of melted LB soft top agarose (at 
48°C) is added to the cell suspension for each 150 mm plate, shortly mixed and then 
poured on two-day-old LB" 9804 agar plates, which have been pre-warmed to 37°C for four 
hours. The plates are then incubated at 37°C until plaques reach appropriate sizes (after 
about 8 to 9 hours). After chilling the plates at 4°C for one hour, phage particles are 
transferred to Hybond N nitrocellulose filters and the orientation of each filter to its plate is 
recorded with a waterproof pen. The filters are then treated by placing them on Whatman 
3MM paper saturated with the appropriate solution. The treatments include denaturation 
solution (0.5M NaOH; 1.5M NaCI) for 2 minutes, followed by neutralization solution (0.5M 
Tris-HCI, pH 7.2; 1 .5M NaCI; 1mM EDTA) for 3 minutes and 2x SSC for 3 minutes. DNA is 
subsequently crosslinked to the filters by UV. 

Filters are then pre-hybridized, hybridized with the radioactivelly labeled ZmMKPI fragment 
and washed as described in Sambrook et al, Molecular cloning: A laboratory manual, Cold 
Spring Harbor Laboratory Press, chapters 9.47-9.57 and 1 1 .45-1 1 .49, 1989. An agar plug 
from the position of a positive plaque is then removed from the master plate and incubated 
overnight at 4°C in 1 ml of 1x lambda dilution buffer, containing 20 fjJ of chloroform. Each 
titer is determined and the phages are re-plated to obtain approximately 200 to 500 plaques 
on a 150 mm plate for a secondary screen as described above. Single plaques of interest 
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are collected from the agar plates and incubated over night at 4°C in 500 fil of 1x lambda 
dilution buffer and 20 \il of chloroform. 

The pBluescript phagemid is excised from the XZAP™ vector as described by the In Vivo 
Excision Protocol using the ExAssist/SOLR System in the Stratagene Uni-ZAP™ XR Library 
Instruction Manual (1993). A 1/100 dilution is made of XL1-Blue MRF and SOLR overnight 
cultures (at 30°C) and incubated at 37°C for 2-3 hours. XL1-Blue MRF cells are then 
pelleted for 10 minutes at 1 ,500xg and re-suspended at an OD 60 o= 1 .0 in 10mM MgS0 4 . 
200 |llI of these XL1-Blue cells are then combined with 250 \i\ of phage stock and 1 |xl of 
ExAssist helper phage in a 50 ml conical tube and incubated at 37°C for 15 minutes. 3 ml of 
LB broth is added and incubated at 37°C for 5 hours, after which the cells are pelleted for 
1 5 minutes at 2,000xg and the supernatant transferred to a new tube. The tube is then 
heated at 70°C for 15 minutes and centrifuged again for 15 minutes at 4,000xg. The 
supernatant, containing the excised phagemid pBluescript packaged as filamentous phage 
particles, is decanted into a new tube. 10 and 100 |il of this phage stock are then added to 
two tubes with 200 |il of SOLR cells that have been allowed to grow to OD 6 oo= 0.5-1.0 
before being removed from the incubator and further incubated at room temperature. The 
tubes are incubated at 37°C for 15 minutes, followed by plating 10-50 fil from each tube on 
LB amp (50 M.g/ml) and over night incubation at 37°C. 

The positive clones are checked for insert size by EcoR\/Xho\ double digestion and end- 
sequencing with T3 and T7 promoter primers (Promega). 

Screening of 360,000 pf u of the library results in three identical clones of 2.2 kb containing 
the 3* poly(A) tail, but lacking part of the 5' end, including the translation initiation site. The 
gene corresponding to the identified partial cDNA clone is named ZmMKP2 , as it is not 
identical with the ZmMKPI fragment used as the probe (92.3% identity on the nucleotide 
level over the 1 96 bp fragment flanked by the primers 1 and 3). An additional 21 3 
nucleotides are amplified and cloned by 5* RACE (rapid amplification of cDNA ends) carried 
out following the instructions of the 573* RACE Kit (Boehringer Mannheim) resulting In a 
longer cDNA sequence of 2,452 bp but still not complete, judged by the predicted mRNA 
length from the RNA gel blot analysis and the absence of a possible translation initiation 
site. 

The sequence information gained from the ZmMKP2 cDNA including the additional 213 
nucleotides obtained by 5' RACE (SEQ ID NO: 21 encoding SEQ ID NO: 22) is used to 
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design two additional backward oriented degenerate primers wherein i is inosine to 3' 
regions conserved between the deduced peptide sequences of ZmMKP2 and AtMKPI: 

Primer 4 (reverse): 5'-gci gcy tti gcr tcy tty tcc-3' (SEQ ID NO: 25); 
Primer 5 (reverse): 5'~ytc ick igc igg iar rtg igt ytc-3' (SEQ ID NO: 26) 

These primers are used to PGR amplify a larger fragment of a MAP kinase phosphatase 
gene from tomato. The amplified and cloned 522 bp long fragment is not identical to 
LeMKPL Therefore, its corresponding gene is named LeMKP2 (SEQ ID NO: 23 encoding 
SEQ ID NO: 24; 75% identity on the nucleotide level over the stretch of 196 bp of 
ZmMKPI flanked by primers 1 and 3). The origins of all identified MAP kinase phosphatase 
homologous gene sequences are confirmed by Southern blot analysis. 

The following Table shows an alignment of a continuous stretch of 312 amino acids of 
AtMKPI with the related amino acid sequence of ZmMKP2. 



AtMKPI 


139 


KREKZAFFDKECSKVADHT^ 

--lllilllllililhl-lllllll.: ||: llllhlllllhll 
HKIX2IAFFDKTCSKVADHVYD 


188 


ZinMKP2 


24 


73 


AtMKPI 


189 


EYFKSDFCYRSLWLQDSPSEDITS ILYLWDY ir'i^VREQSGRIFVHCCQG 

MINI IMtllllMIIMIIIIIIIIIIIIIII ||: MINI 

ErjfFXSDLVYRTLWLQDSFTEDITS ILYIA/FDYFEI^VRB3GGRVLVHCCQG 


238 


ZmMKP2 


74 


123 


AtMKPI 


239 


VSRETTSLVIAYLMWRB^ 

MMIIMI 1 1 1 1 1 i 1 1 f r i i . | | | | i . f | | | } J | | | | j | | | | 

VSRCTSLVIAYUflWREQ^ 


288 


2rriMKP2 


124 


173 


AtMKPI 


289 


VHAFPI^PTSLLJ^KMSEHSPYDPIi^^ 

Ml MM l-lil|:|.| Illlhllll 1 -11111111: 

VHAIPI^FNSVLRMYFMAPHSQ 


338 


ZrrMKP2 


174 


223 


AtMKPI 


339 


LPSAIYIWVGRQCETIMEKDAKAA^ 


388 


ZnnMKP2 


224 


- l-hlll \ :|IMIIII \ llllh 1 MM! 1 : 
VI^SLYVWVC^CDF\rt4E^ 


273 


AtMKPI 


389 




428 


ZmMKP2 


274 


Mll-I- i 1 1 II II s.|. |||: M| 
VCIAFSSMPRtfSDSimaSKDQI^ 


323 


AtMKPI 


429 


GGFVPTLASSNNEHETHLPARE 450 




ZrrMKP2 


324 


II II ■ i llllllll- 

GGWPAFSTSGAGDE7IHLEARE 345 
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What is claimed is: 

1 . A DNA comprising an open reading frame encoding a protein characterized by a 
component amino acid sequence having 40% or more identity with an aligned 
component sequence of SEQ ID NO: 3 

2. The DNA according to claim 1 comprising an open reading frame encoding a plant MAP 
kinase phosphatase. 

3. The DNA according to claim 1 wherein the open reading frame encodes an amino acid 
sequence selected from the group of amino acid sequences described in SEQ ID NO: 
6, SEQ ID NO: 7, SEQ ID NO: 8 t SEQ ID NO: 9, and SEQ ID NO: 10. 

4. The DNA according to claim 1 , wherein the open reading frame encodes a protein 
characterized by the amino acid sequence of SEQ ID NO: 3 

5. The DNA according to claim 1 characterized by the nucleotide sequence of SEQ ID 
NO:1 

6. The DNA according to claim 1 wherein the open reading frame encodes a protein 
contributing to repair of DNA damage in a plant cell. 

7. The DNA according to claim 1 wherein the open reading frame encodes a protein 
conferring hypersensitivity to treatment with methyl methanesulfonate (MMS), 

8. The DNA according to claim 7 wherein the open reading frame encodes a protein 
conferring hypersensitivity to treatment with UV light or X-rays. 

9. The DNA according to claim 7 wherein the open reading frame encodes a protein 
interfering with abscisic acid signal transduction. 

10. The protein encoded by the open reading frame of any one of claims 1 to 9. 

11. A method of producing DNA according to claim 1 , comprising 

- screening a DNA library for clones which are capable of hybridizing to a fragment of 
the DNA defined by SEQ ID NO: 1 , wherein said fragment has a length "of ait least 15 
nucleotides; 

- sequencing hybridizing clones; 

- purifying vector DNA of clones comprising an open reading frame encoding a protein 
with more than 40% sequence identity to SEQ ID NO: 3 

- optionally further processing the purified DNA. 

12. A polymerase chain reaction wherein at least one oligonucleotide used comprises a 
sequence of nucleotides which represents 15 or more basepairs of SEQ ID NO: 1 . 
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SEQUENCE LISTING 



<110> Novartis AG 

<120> Organic compounds 

<130> Silencing Gene 

<140> 
<141> 

<160> 26 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 6356 
<212> ENA 

<213> Arabidopsis thaliana 
<400> 1 

gttcaaggtg gtgttatttn cagtattaga aaagaggctt ctagagagag tttctaaatt 60 
atattttgaa cacggccgga ttggttgtct tatgattaag tgaatgcttt agate tggtg 120 
acgattggta tatgagatat atagtanaat gatagaacat cagaaatact ataagtcacc 180 
atatttttaa aaaataaatg ctatagaatt tttgttttgg taattgttat aactctacaa 240 
agttatgtgt tatagagttt tactagtttc ategttatea tgtgtatagt ataaencaac 300 
aaaagaaatt ttaaatatct gaaaacataa aaatttaata aaatgatgtg agtataagaa 360 
aaaaagaaag aaagaaacaa cgtaaaaaat aaaaatcatt catacatata acaattttca 420 
aaagatcaat gttaacttta attagtcttt tttcttatgt ttcatgcaaa tgcaatagta 480 
ttacttttct ttaatctaag attatgtgtt gcttttaagc aagaactatt caaagagtca 540 
aagacatgea tgtaaacttt agagaaeggg atctttcatc aatcttattt ttttcccttt 600 
tttttttaca acgaacgata ggtagtaagc atgaatctgg ttcaaagttt ggatcagtgt 660 
gggcataatc ctggrttcgct aaaaagaatc aatataatat atgttgagcg taagagaaga 720 
aagctcatag ttctgtgtag aaaaaacgtg gagattggag agaaattact aagagagega 780 
agagaagagt aaacctagaa gatacaaaag aegctatcaa aagattgttt tatgtgttta 840 
tgaaacattc acttaggeca gtataaatat taaatgtcat tttgagaaag aaaaaaaaat 900 
ggtgtcattt: tgaggatata aatattttct caacaagatg ttttagactt tcaacataac 960 
gatcatttaa aactacataa tttatctctt cgttaaaaac ttaacattaa caaaaagtta 1020 
aattcgacaa gaagagttta ttcaataagt atactgtatg tgtttatttt attcacatga 1080 
aaagtgtcaa acaacacttg ggatacatat ttcatgecta aaaategtat aacaacattt 1140 
aaggtaatca gtaaatttga tgectaaact aaaatatgtg accattttga ccaatgagca 1200 
ttttttggaa ataaaactac tgeatacgae ctggtcaaac acatgaactg ccactgcttt 1260 
taaaactacc tttttaaact tttgaegtec atttttattt actaattatt atgttaatca 1320 
aaccactaac attatgattc gaaatttcga gtgatgattg ttagaagtga tgtgaatgat 1380 
gcggtataaa acaaattgtg atatatattc actcatatat atcaaaatca aaatagctta 1440 
tcgttccaaa acatgattga taaaatgtaa ttactattca aaacttaaag ccagtagtta 1500 
aaataaaaca aaataaatag tatatgtttc atattaatgc ccaaccaata ttttgttttg 1560 
atgaggaaaa gtcttttttt ttttttcttt ttttttctca tagttaaaag acataaaaaa 1620 
aattaatatt acaaacaaga caaaaaaaaa gaataaataa aacaacaatc attactgtca 1680 
aatcaattat cagagggaaa agttattaag aaatgtcaac caatgagtat cattatcatc 1740 
atgctttttt agacccttct ttttaattca tcaagattta gtcttgttta taaatttgaa 1800 
gtaaaattct aaatgaatca attctacaat tttttcccta cattattgta acgataaaat 1860 
ttaagatcaa caattacttc gttaattttt tttctgataa aaatatttga tatctttctg 1920 
attatzgatat attagcattg tttttgtatt gtatgtatgg taattaattt tagttcaaaa 1980 
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gaataataaa tggtttgatt agcatgaatt 
gtaataaact ttgtttcttt tggtaaatgt 
attactataa ttacattcaa tgtcagaatt 
agaaaaacaa gaatttatta tctttgacaa 
trtcaaagtt tcaacgcgtt tcttataata 
ataattatca taattaaatt aatcgttact 
cgaatttatt ttattgtaat ttgtttaata 
atctctcttt tccttttgtg ttcttcttct 
ctctctctct gaagaaaaat aaataaaaga 
cactccgtaa gttccaaatc tctctccttt 
ccaggagaat tgcttcaccc cctttttggg 
ctgtttccgt cttctagatt ctgggttcag 
ttcattttgt gtttcggagt tattcatggt 
gcgttttctt cttttctagt ttggctatgt 
ttccctctgt tttggtttct tttagggttt 
ggttattgtt agatcagatg tggatttaaa 
aagcaaagac ccttgtttta cactgtatgt 
aaggtgaagt ctttgagttt gtaattttga 
cgatggggaa tgatgaagct cctcctggtt 
ggtctgcttc acggactgca tcacaagttc 
cgtgtgctat tagttctggg ccgagtcgaa 
cacatcataa cagcaaggct agagcttgtt 
ggaggagctt agacgagtgg cctaaggcgg 
caccaacacc tagcgggaac aaaaccgggg 
agcagcgggr aacagataag agctctggtc 
ataaagaatg ttcaaaggtt gctgatcata 
acaagagcat actgaaaaac aatggaatca 
gtccggaata tttcaagtct gatttttgtt 
cagaggatat agctagtatt ctgtacgatg 
aaagtggaag ganctttgtt cattgttgtc 
tagcatatct gatgtggaga gaagggcaaa 
ctgctagagg tattgctgat cctaacatgg 
agagggttca tgcgttcccg cttagcccta 
cacactctcc ttacgaccct tztgcatcttg 
gttctccgga ttcaagaggt gcatttatca 
ttggtaggca gtgrgaaacc atcatggaga 
ctcgatatga . gaaagtcgaa gcacctatta 
attactggga tgcacttgca agcattttgc 
aaccaggtga caggaaggtc gacgcatata 
tagagggagg ttttgtgcca actttagcat 
ctgcaaggga aaacagttgg agctcactga 
gttttcggra . tgtctccaaa acgccactat 
tgcacccatc aggctcacct tcctcaacaa 
ttctctctcc ogantccgta tgttcaacaa 
aatcccctgg. acgttcgtcc ttgagacctt 
ttcccagcct- atccctcctc ccttcccaaa 
cttttcccca accgtcacca aatagaaagg 
gcctgaaagg atccctgaag ttgccaggtt 
ttacttcaca tccggatgat agtaatgaca 
gcgatctaca cccaccaagt gattgcaaag 
aggaaatcar atccttaatc agttgcagta 
ctgatagccc cggccagcct ttagcatgtc 
tgagcagagc ttacttagat tcagaatctg 
taggagaaac gggtagtagg aatttgtaca 
acaactgttc tctagtagat agcaacaaag 
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taataaaaaa ataagactga ataaacatag 2040 
aaaattagaa aaaatcataa tcccaaggta 2100 
aaacgtagtg aataaaaacc gtaaaacatg 2160 
gcaataaatg aaatgctgac aaaaaattgg 2220 
agaattcaat ttcgtgcagc taatcaggag 2280 
actttataat actcccaaaa taatcgaaaa 2340 
ataaagaatt actgtttcct cccacttccc 2400 
tctccgcttt gtttccccaa tctctctctt 2460 
tctaactttg acggctctct taatcttact 2520 
actctcatat ctatatcgtc cgaacaaaac 2580 
tttttaatca ttttctcaga ttctcagttt 2640 
tttctgtttt gctcttattg aattttctta 2700 
agctgaattt gttaattctg atgttgtttt 2760 
cgtctttgat ctgatgctgg gttattctct 2820 
taagtcggaa tagactgatg ggagcttgat 2880 
gccttcgctg aactaacaag tctatggaag 2940 
tgtgaggaat ttgtctgatt ttgggtgata 3000 
gataagattg gatggtggga agagaggatg 3060 
ctaagaaaat gttttggcgg tctgcctctt 3120 
ctgagggtga tgagcaaagc ctgaacattc 3180 
gatgtccagc tgctcctttg acacctcgtt 3240 
tgccaccatt gcagcctctt gccatttcta 3300 
gttcggatga tgtcggtgag tggcctcatc 3360 - 
agagattgaa gctcgattta tcatcaacgc 3420 
tagctaagag ggagaagatt gctttctttg 3480 
tatatgtggg tggagatgct gtggcgaaag 3540 
cgcatatctt gaactgcgtt ggttttatct 3600 
acagatcctt gtggttacag gatagtccgt 3660 
tgtttgacta ctttgaagat gtgagggagc 3720 
aaggggtttc acgatctacc tcgttggtaa 3780 
gttttgatga tgcatttcag tatgtgaagt 3840 
gctttgcttg ccaattgtta caatgccaaa 3900 
cctccttact tagaatgtac aaaatgtctc 3960 
ttccaaaact gttgaatgat ccatgcccga 4020 
ttcagttacc ttctgcaatt tacatttggg 4080 
aagatgcaaa agctgctgtt tgtcagattg 4140 
tggtggtcag agaaggtgat gagcctgttt 4200 
ctatgattgg gggctcggta attaaagttc 4260 
atctggattt tgagattttt cagaaagcca 4320 
catccaacaa cgaacatgag actcatcttc 4380 
aatgtaagtt tgcatcaagg tttgacaaag -4440 
ctagggtcta ttcagattcg atgatgatcg 4500 
cttcttcatc atccactgcg tcgcctcctt 4560 
attcaggcaa tagcttaaag agtttctctc 4620 
ctattccacc atcgctgaca ttgcctaaat 4680 
cttctcctaa agaatctcgt ggtgtcaata 4740 . 
cttcaccttc tcttgctgag cgtagaggca 4800 
tggctgattc caacagaggc acacctgctt 4860 
tagtcttcaa tctggagggt attagaaacg 4920 
ggacaagtgt agattcagat ttgccagaga 4980 
aatctgacag acacaaatcg ggaggtgata 5040 . 
gttggccaag tatggagatg attacaaaac 5100 
ttatagcaat cccgttgcca agtgatgctg 5160 
tttggatcgg aaagtcattc tctttggata 5220 
cggcagacac tgtggagaat gttgattggg 5280 
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tacaaattgg tgaatccatt ttgtgtcaga 
taataatagc ctaaactttg gaggctctga 
tcatctttgt catgtcttat ccaaccaaac 
gcactttcat tactgactac tgaggacggt 
tatgtattgg tttattctct agtggtttgc 
gtttgtaact gataagttgc ttttctttct 
cagacagagt tgctagcact gctgagcgcg 
ttactctgtt tttttttcac agattccttc 
gtattcattc agctcaggtg agaattctct 
gttgcatccg aatacaatcg ttagttctgg 
ttgagttaga ggatggttgg aacaaaaaaa 
aattctcata gaccatattt gattcttttg 
tatggaagtt gatgttgggg gaaacaactc 
cttctttcta tcacatgaaa atcctcaagg 
tgtaaggtta ggggctttat gaggattttg 
tcagcagcac aatagatttt tgttaaactt 
atgttgaagt. ctaattgaag aattagrttaa 
atccaatttt ggttttgcgc atagtttctc 

<210> 2 
<211> 3059 
<212> EfcOA 

<213> Arabidopsis thaliana 



tggacttgcc aaaagatacc cctataaagg 5340 
tactttttac taattgtaaa gtctgcgtgc 5400 
tatatttcga agatgaaaat tacaatctca 5460 
taggtagaat ccttatgatt tcagcagttg 5520 
atggttccaa cttgttatga tccttttgtt 5580 
tgttaacaga tagttaggga atctgaggat 5640 
ctataacacc cacccgcaag ctctacacat 5700 
aaccgcaaca cttttccatt ttcagacaga 5760 
gaaagcagtzc tgtaacactt catcttcaca 5820 
attatgttta attgctatct gatcatgaat 5880 
cttagaagct cgaatgaccg gtttttacca 5940 
atttacttct ggtgcaggac tctctgtgct 6000 
tcttgtacag tggggaaaaa acttcttctt 6060 
gccattatta gtatgatcag attataaaat 6120 
atggacttgt tacaatgttt acatatacac 6180 
acatgttatt caagtaaaag tactatgtag 6240 
tgatagtctt aaacacttga ttcacttgtc 6300 
ttcttttatt tcctctctaa aacacg 6356 



<400> 2 

cttcttctcc gctttgtttc cccaatctct 
aaagatctaa ctttgacggc tctcttaatc 
ctgatgggag cttgatggtt attgttagat 
aacaagtcta tggaagaagc aaagaccctt 
ctgattttgg gtgataaagg tgaagtcttt 
gtgggaagag aggatgcgat ggggaatgat 
tggcggtctg cctcttggtc tgcttcacgg 
caaagcctga acattccgtg tgctattagt 
cctttgacac ctcgttcaca tcataacagc 
cctcttgcca tttctaggag gagcttagac 
ggtgagtggc ctcatccacc aacacctagc 
gatttatcat caacgcagca gcgggtaaca 
aagattgctt tctttgataa agaatgttca 
gatgctgtgg cgaaagacaa gagcatactg 
tgcgttggtt ttatctgtcc ggaatatttc 
ttacaggata gtccgtcaga ggatataact 
gaagatgtga gggagcaaag tggaaggatc 
tctacctcgt tggtaatagc atatctgatg 
tttcagtatg tgaagtctgc tagaggtatt 
ttgttacaat gccaaaagag ggttcatgcg 
atgtacaaaa tgtctccaca ctctccttat 
aatzgatccat gcccgggttc tctggattca 
gcaatttaca tttgggttgg taggcagtgt 
gctgtttgtc agattgctcg atatgagaaa 
ggtgatgagc ctgtttatta ctgggatgca 
tcggtaatta aagttcaacc aggtgacagg 
atttttcaga aagccataga gggaggtttt 
catgagactc atcttcctgc aagggaaaac 
tcaaggtttg acaaaggttt tcgcrtatgtc 
gattcgatga tgatcgtgca ttcaccaggc 



ctcttctctc tctctgaaga aaaataaata 60 
ttactcactc cggttttaag tcggaataga 120 
cagatgtgga tttaaagcct tcgctgaact 180 
gttttacact gtatgttgtg aggaatttgt 240 
gagtttgtaa ttttgagata agattggatg 300 
gaagctcctc ctggttctaa gaaaatgttt 360 
actgcatcac aagttcctga gggtgatgag 420 
tctgggccga gtcgaagatg tccagctgct 480 
aaggctagag cttgtttgcc accattgcag 540 
gagtggccta aggcgggttc ggatgatgtc 600 
gggaacaaaa ccggggagag attgaagctc 660 
gataagagct ctggtctagc taagagggag 720 
aaggttgctg atcatatata tgtgggtgga 780 
aaaaacaatg gaatcacgca tatcttgaac 840 
aagtctgatt tttgttacag atccttgtgg 900 
agtattctgt acgatgtgtt tgactacttt 960 
tttgttcatt gttgtcaagg ggtttcacga 1020 
tggagagaag ggcaaagttt tgatgatgca 1080 
gctgatccta acatgggctt tgcttgccaa 1140 
ttcccgctta gccctacctc cttacttaga 1200 
gaccctttgc atcttgttcc aaaactgttg 1260 
agaggtgcat ttatcattca gttaccttct 1320 
gaaaccatca tggagaaaga tgcaaaagct 1380 
gtcgaagcac ctattatggt ggtcagagaa 1440 
tttgcaagca ttttgcctat gattgggggc 1500 
aaggtcgacg catataatct ggattttgag 1560 
gtgccaactt tagcatcatc caacaacgaa 1620 
agttggagct cactgaaatg taagtttgca 1680 
tccaaaacgc cactatctag ggtctattca 1740 
tcaccttcct caacaacttc ttcatcatcc 1800 
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actgcgtcgc ctccttttct ctctcccgat tctgtatgtt caacaaattc aggcaatagc 1860 
ttaaagagtt tctctcaatc ctctggacgt tcgtccttga gaccttctat tccaccatcg 1920 
ctgacattgc ctaaattttc cagcctatcc ctcctccctt cccaaacttc tcctaaagaa 1980 
tctcgtggtg tcaatacttt tcttcaaccg tcaccaaata gaaaggcttc accttctctt 2040 
gctgagcgta gaggcagcct gaaaggatct ctgaagttgc caggtttggc tgattccaac 2100 
agaggcacac ctgcttttac tttacatceg gatgatagta atgacatagt cttcaatctg 2160 
gagggtatta gaaacggcga tctatatcca ccaagtgatt gcaaagggac aagtgtagat 2220 
tcagatttgc cagagaagga aattatatcc ttaatcagrtt gcagtaaatc tgacagacac 2280 
aaatcgggag gtgatactga tagctctggc cagcctttag catgtcgttg gccaagtatg 2340 
gagatgatta caaaactgag cagagcttac ttagattcag aatctgttat agcaatcccg 2400 
ttgccaagtg atgctgtagg agaaacgggt agtaggaatt tgtacatttg gatcggaaag 2460 
tcattctctt tggataacaa ctgttctcta gtagatagca acaaagcggc agacactgtg 2520 
gagaatgttg attgggtaca aattggtgaa t.ccattttgt gtcagatgga cttgccaaaa 2580 
gataccccta taaagatagt tagggaatct gaggatcaga cagagttgct agcactgctg 2640 
agcgcgctat aacacccacc cgcaagctct acacatttac tctgtttttt tttcacagat 2700 
tccttcaacc gcaacacttt tccattttca gacagagtat tcattcagct caggactctc 2760 
tgtgcttatg gaagttgatg ttgggggaaa caactctctt gtacagtggg gaaaaaactt 2820 
cttcttcttc tttctatcac atgaaaatcc tcaagggcca ttattagtat gatcagatta 2880 
taaaattgta aggttagggg ctttatgagg attttgatgg acttgttaca atgtttacat 2940 
atacactcag cagcacaata gatttttgtt aaacttacat gttattcaag taaaagtact 3000 
atgtagatgt tgaagtctaa ttgaagaatt agttaatgat aaaaaaaaaa aaaaaaaaa 3059 

<210> 3 
<211> 784 
<212> FRT 

<213> Arabidopsis thaliana 
<400> 3 

Met Val Gly Arg Glu Asp Ala Met Gly Asn Asp Glu Ala Pro Pro Gly 
1 5 10 15 

Ser Lys Lys Met Phe Trp Arg Ser Ala Ser Trp Ser Ala Ser Arg Thr 
20 25 30 

Ala Ser Gin Val Pro Glu Gly Asp Glu Gin Ser Leu Asn lie Pro Cys 
35 40 45 

Ala lie Ser Ser Gly Pro Ser Arg Arg Cys Pro Ala Ala Pro Leu Thr 

50 55 60 ... 

Pro Arg Ser His His Asn Ser Lys Ala Arg Ala Cys Leu Pro Pro Leu , 
65 70 75 80 

Gin Pro Leu. Ala lie Ser Arg Arg Ser Leu Asp Glu Trp Pro Lys Ala 
85 90 95 

Gly Ser Asp Asp Val Gly Glu Trp Pro His Pro Pro Thr Pro Ser Gly 
100 105 110 

Asn Lys Thr Gly Glu Arg Leu Lys Leu Asp Leu Ser Ser Thr Gin Gin 
115 120 125 

Arg Val Thr Asp Lys Ser Ser Gly Leu Ala Lys Arg Glu Lys lie Ala 
130 135 140 
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Phe Phe Asp Lys Glu Cys Ser Lys Val Ala Asp His lie Tyr Val Gly 
145 150 155 160 

Gly Asp Ala Val Ala Lys Asp Lys Ser lie Leu Lys Asn Asn Gly He 
165 170 175 

Thr His He Leu Asn Cys Val Gly Phe He Cys Pro Glu Tyr Phe Lys 
180 185 190 

Ser Asp Phe Cys Tyr Arg Ser Leu Trp Leu Gin Asp Ser Pro Ser Glu 
195 200 205 

Asp He Thr Ser He Leu Tyr Asp Val Phe Asp Tyr Phe Glu Asp Val 
210 215 220 

Arg Glu Gin Ser Gly Arg He Phe Val His Cys Cys Gin Gly Val Ser 
225 230 235 240 

Arg Ser Thr Ser Leu Val He Ala Tyr Leu Met Trp Arg Glu Gly Gin 
245 250 255 

Ser Phe Asp Asp Ala Phe Gin Tyr Val Lys Ser Ala Arg Gly He Ala 
260 265 270 

Asp Pro Asn Met Gly Phe Ala Cys Gin Leu Leu Gin Cys Gin Lys Arg 
275 280 285 

Val His Ala Phe Pro Leu Ser Pro Thr Ser Leu Leu Arg Met Tyr Lys 
290 295 300 

Met Ser Pro His Ser Pro Tyr Asp Pro Leu His Leu Val Pro Lys Leu 
305 310 315 320 

Leu Asn Asp Pro Cys Pro Gly Ser Leu Asp Ser Arg Gly Ala Phe He 
325 330 335 

He Gin Leu Pro Ser Ala He Tyr He Trp Val Gly Arg Gin Cys Glu 
340 345 350 

Thr He Met. Glu Lys Asp Ala Lys Ala Ala Val Cys Gin He Ala Arg 
355 360 365 

Tyr Glu Lys Val Glu Ala Pro He Met Val Val Arg Glu Gly Asp Glu 
37C 375 380 

Pro Val Tyr Tyr Trp Asp Ala Phe Ala Ser He Leu Pro Met He Gly 
385 390 395 400 

Gly Ser Val He Lys Val Gin Pro Gly Asp Arg Lys Val Asp Ala Tyr 
405 410 415 

Asn Leu Asp Phe Glu He Phe Gin Lys Ala He Glu Gly Gly Phe Val 
420 425 430 

Pro Thr Leu Ala Ser Ser Asn Asn Glu His Glu Thr His Leu Pro Ala 
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435 



440 



445 



Arg Glu Asn Ser Trp Ser Ser Leu Lys Cys Lys Phe Ala Ser Arg Phe 
450 455 460 

Asp Lys Gly Phe Arg Tyr Val Ser Lys Thr Pro Leu Ser Arg Val Tyr 
465 470 475 480 

Ser Asp Ser Met Met lie Val His Ser Ser Gly Ser Pro Ser Ser Thr 
485 . 490 495 

Thr Ser Ser Ser Ser Thr Ala Ser Pro Pro Phe Leu Ser Pro Asp Ser 
500 . .505. 510 

Val Cys Ser Thr Asn Ser Gly Asn Ser Leu Lys Ser Phe Ser Gin Ser 
515 520 525 

Ser Gly Arg Ser Ser Leu Arg Pro Ser lie Pro Pro Ser Leu Thr Leu 
530 535 540 

Pro Lys Phe Ser Ser Leu Ser Leu Leu Pro Ser Gin Thr Ser Pro Lys 
545 550 555 560 

Glu Ser Arg Gly Val Asn Thr Phe Leu Gin Pro Ser Pro Asn Arg Lys 
565 570 575 

Ala Ser Pro Ser Leu Ala Glu Arg Arg Gly Ser Leu Lys Gly Ser Leu 
580 585 590 

Lys Leu Pro Gly Leu Ala Asp Ser Asn Arg Gly Thr Pro Ala Phe Thr 
595 600 605 

Leu His Pro Asp Asp Ser Asn Asp lie Val Phe Asn Leu Glu Gly lie 
610 615 620 

Arg Asn Gly Asp Leu Tyr Pro Pro Ser Asp Cys Lys Gly Thr Ser Val 
625 630 635 ■ 640 

Asp Ser Asp Leu Pro Glu Lys Glu lie lie Ser Leu lie Ser Cys Ser 
645 650 655 

Lys Ser Asp Arg His Lys Ser Gly Gly Asp Thr Asp Ser Ser Gly Gin 
660 665 670 

Pro Leu Ala Cys Arg Trp Pro Ser Met Glu Met lie Thr Lys Leu Ser 
675 680 685 

Arg Ala Tyr Leu Asp Ser Glu Ser Val lie Ala lie Pro Leu Pro Ser 
690 695 700 

Asp Ala Val Gly Glu Thr Gly Ser Arg Asn Leu Tyr lie Trp lie Gly 
705 710 715 720 

Lys Ser Phe Ser Leu Asp Asn Asn Cys Ser Leu Val Asp Ser Asn Lys 



725 



730 



735 
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Ala Ala Asp Thr Val Glu Asn Val Asp Trp Val Gin lie Gly Glu Ser 
740 745 750 

lie Leu Cys Gin Met Asp Leu Pro Lys Asp Thr Pro lie Lys lie Val 
755 760 765 

Arg Glu Ser Glu Asp Gin Thr Glu Leu Leu Ala Leu Leu Ser Ala Leu 
770 775 780 



<210> 4 
<211> 10 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 4 

Val His Cys Cys Gin Gly Val Ser Arg Ser 
15 10 



<210> 5 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: mammalian 
sequence motif defining the family of PTPs 

<400> 5 

lie His Cys Xaa Ala Gly Xaa Xaa Arg Ser 
15 10 



<210> 6 
<211> 14 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 6 

Thr Ser lie Leu Tyr Asp Val Phe Asp Tyr Phe Glu Asp Val 
15 10 



<210> 7 
<211> 12 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 7 

Phe Val His Cys Cys Gin Gly Val Ser Arg Ser Thr 
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1 



10 



<210> 8 
<211> 4 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 8 

Phe Val His Cys 



<210> 9 
<211> 6 
<212> PRT 

<213> Arabidopsis thaliana 

<400> 9 

Gin Gly Val Ser Arg Ser 
1 5 



<210> 10 
<211> 5 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 10 

Tyr Phe Lys Ser Asp 
1 5 



<210> 11 
<211> 23 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: degenerate 



<210> 12 
<211> 24 
<212> CNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: degenerate 



1 



primer 



<400> 11 

aayaayggna thacncayat hyt 



23 



primer 



<400> 



12 
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grcangcr aancccatrt tngg 24 

. :o> 13 
:> 24 

-I3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: degenerate 
primer 

<400> 13 

ngtccacatn arrtangcda tnac 24 

<210> 14 
<211> 21 
<212> ena 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 14 

ggtttctaca ggacgtaaca t 21 

<210> 15 

<211> 196 

<212> WA 

<213> Sinapis alba. 

<400> 15 

caactgcgtt ggtttcatct gtcctgaata tttcaagtct gatttttgtt accggtcgtt 60 
gtggttacgt gatagtccat cagaggatat aactagcatt ctctacgatg tctttgacta 120 
ctttgaagac gttagggagc aaagtgggag gatctttgtt cactgttgtc aaggcgtttc 180 
acggtctacc tccttg 196 

<210> 16 
<211> 65 
<212> PRT 

<213> Sinapis a]±>a 
<400> 16 

Asn Cys Val Gly Phe lie Cys Pro Glu Tyr Phe Lys Ser Asp Phe Cys 
1 5 10 15 

Tyr Arg Ser Leu Trp Leu Arg Asp Ser Pro Ser Glu Asp lie Thr Ser 
20 25 30 

He Leu Tyr Asp Val Phe Asp Tyr Phe Glu Asp Val Arg Glu Gin Ser 
35 40 45 

Gly Arg He Phe Val His Cys Cys Gin Gly Val Ser Arg Ser Thr Ser 
50 55 60 

Leu 
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65 



<210> 17 
<211> 196 
<212> ENA 

<213> Lycopersicon esculentum 
<400> 17 

aaactgtgta gggtttagtt gtcctgaata ctttaaggat gaccttgtat acaagacact 60 
ttggctgcag gatagcccca ctgaggacat cacaagtatt ctttatgatg tctttgatta 120 
ctttgaagat gttcatgaac aaggtgggag tgtctttgta cactgcttcc agggggtgtc 180 
ccgatcagcc tccttg 196 

<210> 18 
<211> 65 
<212> PRT 

<213> Lycopersicon esculentum 
<400> 18 

Asn Cys Val Gly Phe Ser Cys Pro Glu Tyr Phe Lys Asp Asp Leu Val 
15 10 15 

Tyr Lys Thr Leu Trp Leu Gin Asp Ser Pro Thr Glu Asp lie Thr Ser 
20 25 30 

lie Leu Tyr Asp Val Phe Asp Tyr Phe Glu Asp Val His Glu Gin Gly 
35 40 45 

Gly Ser Val Phe Val His Cys Phe Gin Gly Val Ser Arg Ser Ala Ser 
50 55 60 

Leu 
65 



<210> 19 
<211> 196 
<212> ENA 
<213> Zea mays 

<400> 19 

caactgcatg ggcttcgtct gcccgaagta cttcaagtct gaccttgtct accgcaccct 60 
ctggctgcag gacagcccca ccgaggacac caccagcatc ctttacgacg tgtttgatta 120 
ctttgaggac gtcagggagc agggtggccg cgtgtttgtg cattgctgcc agggggtgtc 180 
gcgctccacg cctctg 196 

<210> 20 
<211> 65 
<212> PRT 
<213> Zea mays 

<400> 20 

Asn Cys Met Gly Phe Val Cys Pro Lys Tyr Phe Lys Ser Asp Leu Val 
15 10 15 
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Tyr Arg Thr Leu Trp Leu Gin Asp Ser Pro Thr Glu Asp Thr Thr Ser 
20 25 30 

lie Leu Tyr Asp Val Phe Asp Tyr Phe Glu Asp Val Arg Glu Gin Gly 
35 40 45 

Gly Arg Val Phe Val His Cys Cys Gin Gly Val Ser Arg Ser Thr Pro 
50 55 60 

Leu 
65 



<210> 21 
<211> 2450 
<212> ENA 
<213> Zea mays 

<400> 21 

ggcgggtcct cccccgccaa gcccggggag 
tcgcaggggc gcaaggacca gatcgccttc 
cacgtctacc tcggcggcga cgccgtcgcc 
atcacccacg tgctcaactg tgtgggcttt 
gtctaccgca ccctctggct gcaggacagc 
gacgtgtttg actacttcga ggacgtcagg 
tgccaggggg tgtcacgctc cacgtcgctg 
cagagcttcg atgacgcctt ccagtttgtc 
atgggctttg catgccagct tctccagtgc 
ccaaattcag tgctcaggat gtaccgcatg 
ttggtgccca aaatgctcaa tgacccatcc 
attgtgcatg ttctctcgtc gctctatzgtc 
gaaaaggatg caaaggctgc tgcgtttcag 
atcaaggttg tgagagaagg tctggagccg 
ccacctaatt cagatagtaa tacaaagatt 
agtgacccag gaagccggaa aaatgagtcc 
gcaatcactg ggggagtggt ccctgcattt 
cttccagcta gagaaagtag ctggagttta 
gctcgtgttt attcagattc tgctctaatg 
gctgctgagg catcaacctc acctcctttc 
agtgtcagct cgaagtatag ttcagactca 
ccatcatttg gcctctcgcc tgcttcatct 
aggtctcccc tttctcaatc atctaatgaa 
atacactctc cttccaagac ctcttctata 
aagctaccat ctctccaaaa ggatcttgta 
ccaccaaggg cgccatctag tattcgcagg 
ggggttaaac agctgactag tgagttttgc 
tcgcattctg aaactagatt aattgagcgt 
gcacaacttg tagtctacca gtggcccagc 
gatcttgacc cgaagtcggt tttaattttt 
gcagttaaaa tggtgtatgt atgggtagga 
tccgtcgatt ggcaacaggt cactagtgat 
cttcctgtca aggttttcaa ggagcatgaa 
agttaacatt aggcagtagc tatcaggata 



gggctccgcc tcgacctctc ctcgctccgg 60 
ttcgacaagg agtgctccaa ggtcgccgac 120 
aagaaccgcg acatcctcag gaagaacggc 180 
gtctgcccgg agtacttcaa gtcggaccta 240 
cccaccgagg acatcaccag catcctgtac 300 
gagcagggcg gccgcgtgct tgtgcattgc 360 
gtcatcgcct acttgatgtg gagggaaggc 420 
aaggctgccc gggggatcgc aaatccaaac 480 
cagaagcgtg tgcatgcgat tccgctgtca 540 
gcgcctcact cccagtatgc ccctctgcat 600 
ccagccaccc ttgactctag aggcgcgttc 660 
tgggttggaa tgaagtgtga tccggtaatg 720 
gtagtgaggt atgagaaggt gcaggggcac 780 
caggagttct gggatgcatt ttcaagtatg 840 
agcaaggacc agatcgattc agcatccaag 900 
tatgacgctg attttgagct tgtctacaaa 960 
tcaacttctg gggctggtga tgagacccat 1020 
ctgaggcaca agtttatctc caggtcgcta 1080 
aaggattttg atccacgggt acaacacctg 1140 
ctttctccaa gctccttatc atcggattca 1200 
ccctccttat cacctacaac tggctctcca 1260 
aatctgacac atgctttggt gccatcatcc 1320 
ggagcttcaa agccttctgg catggaatca 1380 
gcagaaagga gaggaggctt cacacttcta 1440 1 
ttgccaccaa gggtgccgtc tattgtattg 1500 
accgaggatg cctcagataa tagtacaaat 1560 
tcagaaaaat gcactggtaa tagtttgagc 1620 
actgacagta actcagaagt ctgcagtaat 1680 
atgggaaagc taactacatt tgcacgcaag 1740 
gttacttcga atgccatcag gagaggagaa 1800 
ggcgaaaatg agagcagcaa gagtgttgac 1860 
tttcttcatc taaagggcct cagcaatgtt 1920 
gctgagaatc ttttggaact actgaatgtt 1980 
attgtagttg ctaaacaaac tcaacgaagg 2040 
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catgccctcc agcatcagtc ggtaccgatg attgtcagcg aggtataaag ccacagccat 2100 
tcccttgaac ataataagct acaaacagat tccgttctgc aactgcgcct catgatctat 2160 
attttgtcca gatggcagga ggctgccatg ggcgttgtat cggttgcgaa ttagcactcg 2220 
tggtgtagga gcaatcggcc gattcggtgt atattatccg ctcccctgta atgtaagctc 2280 
agatactggg agctggtgtg tcgacagtta ctttttagcc taaacattct tgtacatctt 2340 
tgaaaggaac agagttgtaa tccttttgac tatgtaaatg gctccattgg tcataacttc 2400 
taaaccatgt ggaactcagt tgctagctgg taaaaaaaaa aaaaaaaaaa 2450 

<210> 22 
<211> 661 
<212> PRT 
<213> Zea mays 

<400> 22 

Gly Gly Ser Ser Pro Ala Lys Pro Gly Glu Gly Leu Arg Leu Asp Leu 
15 10 15 

Ser Ser Leu Arg Ser Gin Gly Arg Lys Asp Gin lie Ala Phe Phe Asp 
20 25 30 

Lys Glu Cys Ser Lys Val Ala Asp His Val Tyr Leu Gly Gly Asp Ala 
35 40 45 

Val Ala Lys Asn Arg Asp lie Leu Arg Lys Asn Gly lie Thr His Val 
50 55 60 

Leu Asn Cys Val Gly Phe Val Cys Pro Glu Tyr Phe Lys Ser Asp Leu 
65 70 75 80 

Val Tyr Arg Thr Leu Trp Leu Gin Asp Ser Pro Thr Glu Asp lie Thr 
85 90 95 

Ser lie Leu Tyr Asp Val Phe Asp Tyr Phe Glu Asp Val Arg Glu Gin 
100 105 110 

Gly Gly Arg Val Leu Val His Cys Cys Gin Gly Val Ser Arg Ser Thr 
115 120 125 

Ser Leu Val lie Ala Tyr Leu Met Trp Arg Glu Gly Gin Ser Phe Asp . - 
.130. 135 140 

Asp Ala Phe Gin Phe Val Lys Ala Ala Arg Gly lie Ala Asn Pro Asn 
145 150 155 160 

Met Gly Phe Ala Cys Gin Leu Leu Gin Cys Gin Lys Arg Val His Ala 
165 170 175 

lie Pro Leu Ser Pro Asn Ser Val Leu Arg Met Tyr Arg Met Ala Pro 
180 185 190 

His Ser Gin Tyr Ala Pro Leu His Leu Val Pro Lys Met Leu Asn Asp 
195 200 205 

Pro Ser Pro Ala Thr Leu Asp Ser Arg Gly Ala Phe lie Val His Val 
210 215 220 
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Leu Ser Ser Lieu Tyr Val Trp Val Gly Met Lys Cys Asp Pro Val Met 
225 230 235 240 

Glu Lys Asp Ala Lys Ala Ala Ala Phe Gin Val Val Arg Tyr Glu Lys 
245 250 255 

Val Gin Gly His lie Lys Val Val Arg Glu Gly Leu Glu Pro Gin Glu 
260 265 270 

Phe Trp Asp Ala Phe Ser Ser Met Pro Pro Asn Ser Asp Ser Asn Thr 
275 280 285 

Lys lie Ser Lys Asp Gin lie Asp Ser Ala Ser Lys Ser Asp Pro Gly 
290 295 300 

Ser Arg Lys Asn Glu Ser Tyr Asp Ala Asp Phe Glu Leu Val Tyr Lys 
305 310 315 320 

Ala He Thr Gly Gly Val Val Pro Ala Phe Ser Thr Ser Gly Ala Gly 
325 330 335 

Asp Glu Thr His Leu Pro Ala Arg Glu Ser Ser Trp Ser Leu Leu Arg 
340 345 350 

His Lys Phe He Ser Arg Ser Leu Ala Arg Val Tyr Ser Asp Ser Ala 
355 360 365 

Leu Met Lys Asp Phe Asp Pro Arg Val Gin His Leu Ala Ala Glu Ala 
370 375 380 

Ser Thr Ser Pro Pro Phe Leu Ser Pro Ser Ser Leu Ser Ser Asp Ser 
385 390 395 400 

Ser Val Ser Ser Lys Tyr Ser Ser Asp Ser Pro Ser Leu Ser Pro Thr 
405 410 415 

Thr Gly Ser Pro Pro Ser Phe Gly Leu Ser Pro Ala Ser Ser Asn Leu 
420 425 430 

Thr His Ala Leu Val Pro Ser Ser Arg Ser Pro Leu Ser Gin Ser Ser 
435 440 445 

Asn Glu Gly Ala Ser Lys Pro Ser Gly Met Glu Ser He His Ser Pro 
450 455 460 

Ser Lys Thr Ser Ser He Ala Glu Arg Arg Gly Gly Phe Thr Leu Leu 
465 470 475 480 

Lys Leu Pro Ser Leu Gin Lys Asp Leu Val Leu Pro Pro Arg Val Pro 
485 490 495 

Ser He Val Leu Pro Pro Arg Ala Pro Ser Ser He Arg Arg Thr Glu 
500 505 510 
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Asp Ala Ser Asp Asn Ser Thr Asn Gly Val Lys Gin Leu Thr Ser Glu 
515 520 525 

Phe Cys Ser Glu Lys Cys Thr Gly Asn Ser Leu Ser Ser His Ser Glu 
530 535 540 

Thr Arg Leu lie Glu Arg Thr Asp Ser Asn Ser Glu Val Cys Ser Asn 
545 550 555 560 

Ala Gin Leu Val Val Tyr Gin Trp Pro Ser Met Gly Lys Leu Thr Thr 
565 570 575 

Phe Ala Arg Lys Asp Leu Asp Pro Lys Ser Val Leu lie Phe Val Thr 
580 585 590 

Ser Asn Ala lie Arg Arg Gly Glu Ala Val Lys Met Val Tyr Val Trp 
595 600 605 

Val Gly Gly Glu Asn Glu Ser Ser Lys Ser Val Asp Ser Val Asp Trp 
610 615 620 

Gin Gin Val Thr Ser Asp Phe Leu His Leu Lys Gly Leu Ser Asn Val 
625 630 635 640 

Leu Pro Val Lys Val Phe Lys Glu His Glu Ala Glu Asn Leu Leu Glu 
645 650 655 . 

Leu Leu Asn Val Ser 
660 



<210> 23 
<211> 522 
<212> mm 

<213> Lycopersicon esculentum 
<400> 23 

aactgtgtgg ggtttgtatg cccagagtat ttcaagtctg atttcgtata ccggactttg 60 
tggttgcagg acagcccatc agaagatatt actagtattc tctatgatgt ttttgactac 120 
tttgaagatg teagggagca acatgggaag gtttttgttc attgctgcca aggggtctct 180 
cggtcaacct cgrtggttat tgcttatcgt atgtggagag aaggacaaag ttttgatgat 240 
gcctttgagc atgtaaaggc agcaaggggt attgcggatc caaatatggg ttttgcttgt 300 
cagttattac aatgccaaaa aagggttcat gcttctcctt tgagcccaag ttcattatta 360 
aggatgtaca gagmrgcacc tcattcacca tacgatcctt tgcatctcgt cccaaaaatg 420 
ttaaatgatc cctcaccggc agcattagat tctagaggtg catttattat acacatacct 480 
tcatcggtat atgtatggat tggtaagaaa tgtgaagcaa tc 522 

<210> 24 
<211> 174 
<212> PRT 

<213> Lycopersicon esculentum 
<400> 24 

Asn Cys Val Gly Phe Val Cys Pro Glu Tyr Phe Lys Ser Asp Phe Val 
15 10 15 
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Tyr Arg Thr Leu Trp Leu Gin Asp Ser Pro Ser Glu Asp lie Thr Ser 
20 25 30 

lie Leu Tyr Asp Val Phe Asp Tyr Phe Glu Asp Val Arg Glu Gin His 
35 40 45 

Gly Lys Val Phe Val His Cys Cys Gin Gly Val Ser Arg Ser Thr Ser 
50 55 60 

Leu Val lie Ala Tyr Arg Met Trp Arg Glu Gly Gin Ser Phe Asp Asp 
65 70 75 80 

Ala Phe Glu Tyr Val Lys Ala Ala Arg Gly lie Ala Asp Pro Asn Met 
85 90 95 

Gly Phe Ala Cys Gin Leu Leu Gin Cys Gin Lys Arg Val His Ala Ser 
100 105 110 

Pro Leu Ser Pro Ser Ser Leu Leu Arg Met Tyr Arg Val Ala Pro His 
115 120 125 

Ser Pro Tyr Asp Pro Leu His Leu Val Pro Lys Met Leu Asn Asp Pro 
130 135 140 

Ser Pro Ala Ala Leu Asp Ser Arg Gly Ala Phe lie lie His lie Pro 
145 150 155 160 

Ser Ser Val Tyr Val Trp lie Gly Lys Lys Cys Glu Ala lie 
165 170 



<210> 25 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: degenerate 
primer 

<400> 25 

gcngcyttng crtcyttytc c 21 

<210> 26 
<211> 24 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: degenerate 
primer 

<400> 26 

ytcnckngcn ggnarrtgng tytc 24 
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